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A  GENERAL  LEAST  SQUARES  SOLUTION  FOR  SUCCESSIVE  INTERVALS 

Gertrude  W.  Diederich,  Samuel  J.  Messick, 
and  Ledyard  R  Tucker 

Abstract 

A  general  least  squares  solution  for  successive  intervals  is  presented, 
along  with  iterative  procedures  for  obtaining  stimulus  scale  values,  dis- 
criminal  dispersions,  and  category  boundaries.,  Because  provisions  for 
weighting  were  incorporated  into  the  derivation,  the  solution  may  be  applied 
without  loss  of  rigor. to  the  typical  experimental  matrix  of  incomplete  data, 
i.e.,  to  a  data  matrix  with  mi s s i ng-pntr i« %  as  well  as  to  the  rarely  occur¬ 
ring  matrix  of  complete  data.  The  use  of  weights  also  permits  adjustments 
for  variations  in  the  reliability  of  estimates  obtained  from  the  data.  The 
computational  steps  involved  in  the  solution  are  enumerated,  the  amount  of 
labor  required  comparing  favorably  with  other  procedures.  A  quick,  yet 
accurate,  graphical  approximation  suggested  by  the  least  squares  derivation 


is  also  described. 


A  GENERAL  LEAST  SQUARES  SOLUTION  FOR  SUCCESSIVE  INTERVALS 


Since  Thurstone  first  developed  the  scaling  method  of  successive 
intervals,  it  has  appeared  in  several'  essentially  identical  forms  under 
various  names,  such  as  absolute  scaling  (15),  equal  discriminability 
scaling  (6),  and  graded  dichotomies  (l).  The  procedure  was  first  pub™ 
lished  as  a  psychological  scaling  method  by  Saffir  (l4),  the  basic 
rationale  having  been  previously  presented  by  Thurstone  in  his  absolute 
scaling  of  psychological  tests  (15,  18). 

Experimental  Procedures  for  Successive  Intervals 

The  experimental  procedure  for  the  method'  of  successive  intervals 
requires  n  stimuli  to  be  sorted  into  (  k  +  1  )  categories  on  some 
attribute  continuum.  The  categories  are  usually  ranked  so  that  a  stimu 
lus  placed  in  category  £  is  Judged  to  have  a  higher  psychological 
scale  value  and,  in  some  sense,  "more  of"  the  attribute  in  question 
than  a  stimulus  placed  in  category  (  g  -  1  ) .  Analogously  to  Cases  I 
and  II  of  the  Law  of  Comparative  Judgment  (17),  the  sorting  procedure 
may  be  repeated  N  times  by  the  same  person  or  performed  once  by  each 
of  N  different  people. 

This  procedure  yields  a  frequency  distribution'  for  each  stimulus 
over  several  of  the  categories  on  the  attribute  continuum,  i.e.,  it 
yields  the  number  of  times,'  f  ,  that  the  ith  stimulus  was  placed 
in  the  gth  category.  These  frequencies  provide  the  raw  material  for 
the  analytical  procedure  of  successive  intervals,  which  is  an  attempt 
to  account  for  the  data  by  a  single  psychological  scale.  The  basic 
consideration  is  whether  or  not  these  frequency  distributions  can  be 
simultaneously  converted  to  a  common  distribution,  allowing  unequal 
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means  and  variances,  on  the  same  base  line.  The  means  of  the  converted 
distributions  would  then  correspond  to  stimulus  scale  values  and  the 
standard  deviations  to  what  Thurstone  has  called  "discriminal  diaper* 
sions"  (l6) .  Scale  values  for  category  boundaries  are  also  obtained 
from  the  method  of  successive  intervals,  thus  permitting  estimates  of 
the  size  of  categories  rather  than  assuming  them  to  be  equal  as  in 
the  method  of  equal-appearing  intervals  (19)* 

Solutions  to  the  Scaling  Problem 

Successive  intervale  solutions  for  the  n  stimulus  values  have 
been  suggested  by  Saffir  (l4),  Guilford  (9),  Mosier  (12),  Bishop  (2), 
Attneave  (l).  Garner  and  Hake  (6),  Edwards  (3),  Gulliksen  (10),  and 
Rimoldi  (l3).  Some  of  these  articles  also  offer  solutions  for  the 
n  discriminal  dispersions  and  the  k  category  boundaries.  The 
procedures  vary  in  computational  routine  and  with  respect  to  certain 
restricting  assumptions,  but  they  are  essentially  equivalent.  These 
procedures  involve  obtaining  the  proportion,  p.  ,  of  times  a 

J-O 

stimulus  i  was  placed  in  category  g  .  The  values  are  then 

emulated  to  give  the  proportion  of  times  stimulus  i  was  placed 
below  the  gth  category  boundary,  t  ,  and  these  cumulative  pro- 
portions  are  usually  converted  into  normal  deviate  values,  z ^  . 

Various  successive  intervals  solutions  presented  in  the  literature- 
have  used  normal  curve  transformations  .or  other  similar  transforma¬ 
tions  such  as  the  logistic.  Any  similar  function  giving  a  one-to-one 
correspondence  between  p^  and  z ^  could  be  used  (see  10). 


The  normal  deviate  value  obtained  from  such  a  transformation  may  be 
expressed  as  follows: 


t  -  mJ 


ig  s, 


where  z^  *  the  normal  deviate  value  corresponding  to  a  cumulated  proportion, 

t  .«  the  upper  boundary  of  the  gth  category, 
g 

m^  =  the  scale  value  of  stimulus  i  ,  and 

s^  «s  the  discriminal  dispersion  for  stimulus  1  . 

The  category  boundary  can  then  be  expressed  as 

*g  -  »i  +  Vig  •  (2) 


This  equation  is  what  Torgerson  calls  a  special  case  of  the  Law  of 

Categorical  Judgment  (20 ).  Algebraic  solutions  for  m  ,  s  ,  and  t 

11  g 

can  then  be  obtained  from  this  relationship  by  arbitrarily  choosing  one 

\ 

of  the  s^  values  as  a  unit  and  one  of  the  m^  values  or  their  average 
as  an  origin. 

Gulliksen  (lO)  derived  an  explicit  least  squares  solution  for  m^  , 

•s  ,  and  t  by  minimizing  the  following  error  term: 
i  g 


= $  k  k* + v*.  ■  v1 


where  b  is  an  arbitrary  scale  factor. 


The  following  restrictions,  which  fix  an  origin  and  a  unit,  were  at¬ 
tached  to  the  function  to  be  minimized  by  Lagrange  multipliers: 
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the  "incomplete  overlap"  situation  could  be  easily  handled  without 
loss  of  rigor  by  assigning  weights  of  zero  to  the  missing  entries . 

As  it  turned  out,  the  present  iterative  solution  also  involves  a  com¬ 
putational  routine  that  is  not  excessively  laborious. 

The  general  strategy,  then,  of  the  present  least  squares  deriva¬ 
tion  is  to  solve  implicitly  for  m^  ,  s^  ,  and  t  in  terms  of  each 
other  and  then  to  set  up  rules  for  iterating  to  convergence  values. 
The  derivation  proceeds  as  follows : 


**  ?ig 
from  stimulus 


represent  an  estimate  of  t 

i  if  in,  and  s,  were  known 
—  i  i 


that  would  be  obtained 


+  s  z 
i  ig 


(6) 


Let  e. 
ig 


represent  the  error  made  by  estimating 


t 


g 


in  this  fashion: 


m.  +  s . z . 
i  i  ig 


-  t 


g 


(7) 


Since  the  error  term  to  be  minimized  must  contain  provisions  for 
weighting,  let  it  be  written  as 


.  n  k 
E  =  E  E 

i=l  g=l 


2 

v.  e, 
ig  ig 


y 


(8) 
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(12) 
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These  restrictions  generalize  for  the  weighted  case  the  definitions  used 
hy  Gulliksen  in  the  unweighted  case  (see  equations  4  and  5)-  Since  the 
t-scale  is  determined  only  within  a  linear  transformation,  a  and  b 
may  be  set  at  any  values  desired,  e.g.,  a  convenient  possibility  for  the 
origin,  a  ,  might  be  zero  and  for  the  scale  unit,  b  ,  which  must  bb- 
positive,  might  be  unity. 

Using  two  Lagrange  multipliers  y  and  2\  ,  the  restrictions 

setting  origin  and  unit  may  be  included  in  the  error  term  as  follows: 


«  *  h  Z  S  »lK  (■»!  +  Vig  -  ‘  )2  S  w  -  Va) 

b  i«l  gal  lg  lg  g  g»l  g  i=l  g 


1c  n 

-  7  [  E  Lw.  -  W(a2  +  b2)]  .  (13) 

g«l  8  i  6 


Except  for  the  weights,  w  ,  equation  13  is  identical  to  the  term 

lg 

minimized  by  Gulliksen  (see  equations  3;  and  5).  His  solution,  then, 

is  the  special  case  of  the  present  one  in  which  all  of  the  weights  are 

k  n 

equal  to  unity  and  Ew.  *  k  and  Ew,  *  n  .  Because  of  this  restriction 

g  lg  i  le 

to  unit  weights,  only  data  with  complete  overlap  can  be  considered.  Equa¬ 
tion  13  is  also  similar  to  the  term  minimized  by  Tucker  (22)  in  developing 
a  least  squares  solution  to  the  normal  ogive  model  for  categorical  data, 
which  is  formally  equivalent  to  the  successive  intervals  situation  (see  21, 
Chapter  13 )»  Tucker's  solution,  like  the  present  one,  involves  weights 
and  is  iterative,  but  instead  of  minimizing  the  sum  of  squared  differences 
between  theoretical  and  estimated  t-values,  he  minimized  the  sum  of  squared 


differences  between  theoretical  and  observed  z~values. 
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The  differentiation  of  Q  with  respect  to  each  of  the  m^  in  turn 
yields  the  n  equations 


1  E  w.  (m.  +  s  z  -  t  )  (+l)  (i«l...n) 

2  m±  b2  ,  ig  i  i  ig  g'  '  '  ' 


(14) 


Expanding  and  setting  the  partial  derivative  equal  to  zero, 


m.  Ew.  +  s.  Ew.  z.  -  Ew.  t  =  0 
1  g  iS  i  g  16  IS  g  ie  S  ■ 


(15) 


The  solution  for  m.  can  now  he  written  as 


Ew .  t  -  s .  Ew .  z . 
ig  g  l  ig  ig 

m.  =  * - r - S - 

l  k 


Ew 

g 


Ig 


(16) 


Q  is  now  differentiated  with  respect  to  each  s^  in  turn  to  yield 
the  n  equations 


1  a  i_ 


a  ST  =  75  ^igK  +  Vig  -  <1-1-  •  •”> 

i  h  g 


(17) 


Expanding  and  setting  the  partial  derivative  equal  to  zero, 


m.  Ew,  z,  +  s.  Ew.  z.  -  Ew.  t  z.  =0 
i  g  ig  ig  i  g  ig  ig  ig  g  ig 


(18) 


Substituting  the  value  of  from  equation  1 6  we  have 


(Ew  z  )(Ew  t  ) 
g  ig  ig  g  ig  g 


k 

si(Ew 


k 

Ew 

g 


z  ) 

igZig>  k  k 

■“ -  +  s.  Ew.  z.  -  Ew.  t  z.  ■  0 

*  i  g  ig  ig  g  ig  g  ig 


ig 


Ew 

g 


ig 


(19) 


Solving  for  s^  , 


(figVig)(fig>  ■  (fig'igXfigV 
(fig2ig)<fig)  •  (figzig)2 


(20) 


For  the  purpose  of  parenthetical  comment  upon  the  form  of  , 

equation  20  can  be  rewritten  in  the  following  manner; 


Ew.  (t  -  t.  )(z.  -  z  ) 

R  ig  g  i  ig  i 

k  2 

Ew,  (z,  -  z.  ) 

g  Ig  ig  i 


(21) 


where 


EW. 

K  lg  lg 
- 


Ew 


ig 


and 


Ew.  t 

rr  lg  8 

t.  «.  & - 

i  k 


Ew 


ig 


It  is  apparent  from  the  form  of  equation  21  that  s^  is  the  slope  of  a 
regression  line.  It  Is  the  coefficient  for  the  regression  of  t  on  z 
This  immediately  suggests  a  graphical  representation  of  the  data,  which 
will  be  presented  in  a  later  section.  It  is  also  interesting  to  note  that 


eS.'*'- 


. r~'  r 
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Tucker 's  solution  (22),  which  minimized  the  sum  of  squared  errors  in  the 
z  direction  instead  of  in  the  t  direction,  involves  the  other  regres¬ 
sion — the  regression  of  z  on  t 

Utilizing  the  value  of  s^  obtained  in  equation  20,  it  is  also 
possible  to  rewrite  the  f ormula  for  m^  in  terms  of  t  as  follows : 


(Ew.  t  )(Ew.  z?  )  -  (Ew.  t  z  )(Ew  z.  ) 
_  ig  g'\  ig  ig  v  ig  g  ig/v  ig  ig' 

n,  _  _a _ a _ a _ I _ a _ 

i  k  „  k  k 


(22) 


(fig4)(fi8>  -  (fVi/ 


1 


.  f 


As  will  be  seen  in  a  later  section,  this  is  the  form  of  m^  which  it  will 

be  convenient  to  use  in  computational  routines . 

Differentiating  Q  with  respect  to  each  t  in  turn  yields  the  k 

S 

equations 

1  rio  1  n  n  n 

-  -rr —  =  — ~  Ew  (m,  +  s ,  z .  -  t  )(-l)  +  \  Ew,  -  yt  Ew 

2  dT  >,2  ^  igv  i  i  ig  gM  '  A  ig  '  g  1  ig 


g  b  i 


(g=l- • -k) 


Expanding  and  setting  the  partial  derivative  equal  to  zero, 


n  n  2.  n  2  n 

t  Ew.  -  Ew.  (m.  +  s , z .  )  +  b  \  Ew.  -  b  yt  Ew.  =  0 

"  ■*"  -1  4  4~/  i  ig  '  g  ^  ig 


W  .  -  Ljfi  j  1  Q  *  ,  j 

S  ±  ig  i  ig  1  i  ig 


Define 


n 


+  v±g> 


v  5 

g 


n 

Ew 


ig 


(23) 


(24) 


(25) 


1 
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n 


Rearranging  the  terms  of  equation  2k,  dividing  both  sides  by  Ew.  ,  and 

i  g 

substituting  the  definition  of  v  from  equation  25,  we  may  write 

g 


(l  -  b2r)t-  -  v  -  b2*. 
g  g 


(26) 


The  solution  for  t  can  now  be  written  as 

g 


v  -  b\ 

t  =  _S _ 

6  1  -  b27 


(27) 


Summing  equation  2k  over  g  and  utilizing  the  definition  of  equation 
9,  we  may  write 


kn  o  P 

Wa  -  EEw.  (m.  +  s.z.  )  +  Wb  A.  -  Wab  7  =  0 
^g  -*•  1  ig 


(28) 


kn 


Consider  the  term.  EEw.  (m.  +  s.z.  )  .  By  interchanging  the  order  of 

gi  ig  i  i  ig 

summation  and  by  inserting  the  values  of  s^  and  m^  given  in  equations 
20  and  22,  respectively,  this  term  may  be  written  as 


k  1C  1C 

.  .  (Ew.  t  )[(Ev.  z2  ) (Ew  )  -  (Ew,  z,  )2] 

n  k  k  n  v  ig  gnv  ig  igM  ig'  v  ig  ig' 

Z(m,  Ew  +  s  Ew,  z,  )  ■  E  [-“ - r — = - r — ® - r — ® - 

'  i  _  ig  i  ig  ig'  k  „  k  k 


g 


g 


1 


(a -  (»Vlg)S 


g  18  18  g  18 


nk 

=  EEw  t 

ig 18  8 


(29) 


f 
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Utilizing  the  definition  of  an  origin  given  in  equation  $),  the  term  can  be 
further  simplified  to 


nk  nk 

EEw.  (m,  +  s.z.  )  =  EEw  t  »  Wa 
ig  ig  i  i  ig  ig  ig  g 


Now,  equation  28  can  be  written  as 


Wb2X  *  Wab2r  , 


(30) 


from  which 


X  =  a7 


(31) 


It  should  be  noted  in  passing  that  equation  30,  in  terms  of  the  v  of 
equation  25,  indicates  that  the  weighted  mean  of  v  is 

a 

k  n 


Ev  Ew. 

-  g  g  i  ig 
v  -  -  -  a 


EEw 

gi 


(32) 


ig 


If  the  above  value  of  X  is  substituted  into  equation  26  and 
2 

(1  -  b  y)a  is  subtracted  from  both  sides  of  the  equation,  then  equation 
26  becomes 


(l  -  b  7 )(tg  -  a)  -  vg  -  a 


(33) 


n 


Squaring  both  sides  of  equation  33,  multiplying  through  by  Ew  ,  and 

i  6 

summing  over  g  ,  we  may  write 


o  o  ^  9  ®  ^  o  ** 

(1  -  b  7)  E(t  -  a)  Ew  -  E(v  -  a)*  2Y 
g  g  i  g  g  S 


ig 


(3^) 
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Utilizing  the  definition  of  a  scale  unit  in  the  form  given  in  equation  12, 
we  may  write  equation  jk  as 


p  p  o  **■ 

(1  -  b  7)  Wb  =  E(v 


?  n 

a)  2Wj 


Using  the  value  of  \  found  in  equation  31,  the  solution  for 
given  in  equation  27  may  he  written  as 


t 

6 


(37) 


2 

Substituting  the  value  of  (  1  -  b  7  )  from  equation  J>6,  the  solution  for 

t  becomes 
g 


v  =  a 


7  may  also  be  represented  in  terms  of  the  sum  of  squares  of  errors 
as  follows :  Equation  8  may  be  written  as 
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nk 


ig 


2  nk 

iZig  ‘ 

t  )  *  EEw,  (m.  +  s . z 

s'  lg  is'  1  1  : 

n 

k  „  n 

;  figK  +  Vig>  +  f  g  fig 

(39) 


Consider  the  first  term  of  the  above  expression: 


nk 


nk 


n  k 


fV-i +  Vig>  ■  +  Bizig)(mi  -  8izig)  -  fi  fig(mi +  si2ig) 

Ig  .  1  g 


(40) 


+  Es.  Ew,  z  (m,  +  s.z,  ) 

±  i  ig  ig  i  i  ig 

Substituting  into  the  last  term  of  equation  40  the  values  of  s^  and  m^ 

from  equations  20  and  22,  respectively,  and  simplifying>  and  remembering 

k  k 

that  Ew  (m.  +  s  z  )  was  found  to  be  equal  to  Ew  t  in  equation  29, 
g  ig  i  i  ig  g  ig  S 

we  may  rewrite  equation  40  as 


nk 

EEw, 

ig 


n  k 


n  k 
g 


i8(ni  *  Vig>  -  ft  fig^ +  fi  figVig "  fg.figKy  Vig>  • (41) 


k  n 
Et 
g 


Equation  39  niay  now  be  written  as 


k  n  k  n 

b  E  =  -  Et  Ew.  (m.  +  s.z.  )  +  Et  Ew. 

g  8 1  ig  i  1  18  g  6 1 18 


(42) 


Utilizing  the  definition  of  a  scale  unit  from  equation  10,  equation  42  becomes 

n 

b2E  =  W(a2  +  b2)  -  Etg  Ewig(mi  +  Sizig)  .  (43) 
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Ihe  size  of  the  error  term  at  the  solution . may  now  be  evaluated  by- 

substituting  for  t  in  equation  43  its  value  from  equation  37>  and  by 
n  8  n 

noting,  that  Ew  (m.  +  s  z.  )  ■  v  Ew  from  equation  25.  Equation  43 
^  18  i  i  IS  8  18 

then  becomes 


?  op  -I  k  p  n  k  n  kn 

b  E  -  W(a  +  b  )  -  - =-*-  [  Ev  Ew4  -  a  Ev  Ew4  ]  -  a  Ev  Ew4 

1  -  1%  (‘i1'  8*il*  g  e  1  le 


•  (W) 


o  k  n 

In  equation  32  it  was  seen  that  Ev  Ew  *  Wa  ,  so  equation  44  may  be 

8  g  i  g 

written  as 


2  2  2 
b  E  -  Wa  +  Wb 


.  k  „  n  P  „ 

- =-*-  [  Ev  Ew.  -  Wa  ]  -  Wa 

1  -  *27  g  g  i  ig 


k  2  n  2  k  2  n 

Rewriting  [  Ev  Ew  -  Wa^ ]  as  E(v  -  a;  Ew  ,  and  noting  the  value 

g  8  i  lg  g  8  i  8 

of  this  term  in  equation  35>  we  may  write  equation  45  as 


b2E  -  Wb2  -  (1  -  b27)Wb2 


The  error  term  may  now  be  expressed  as 


E  -  Wb2/ 


Ihe  Iterative  Procedure 


Since  the  t- sc ale  of  successive  intervals  is  determined  only  with  a 
linear  transformation,  the  origin,  a  ,  and  the  scale  unit,  b  ,  may 
be  set  at  any  values  desired.  Ihe  values  most  convenient  for  computational 
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routines  are  a  =  0  and  b  *  1  .  With  such  a  placement  of  origin  and 

unit,  the  restrictions  given  in  equations  9  and  10  may  be  restated  as 

k  n 

Et  Ew  *  0  and  (48) 

b8i1b 

k  n 

Et  Ew  «  W  ,  respectively.  (49) 

g  g  i  g 


The  formula  for  category  boundaries  given  in  equation  38' may  also  be 
rewritten  in  terms  of  this  origin  and  unit  as  follows; 


t  = 


JL 


k  n 
rt  Ev  Ew . 
w  g  g  i  lg 


(50) 


Now,  equations  20,  22,  25,  and  50  may  be  used  to  set  up  an  iterative 

procedure  to  obtain  convergence  values  for  s.  ,  m.  ,  and  t  .  If 

0 

some  initial  estimate,  tgl  ,  of  the  category  boundaries,  ,  were 

available,  equation  20  could  be  solved  to  obtain  initial  estimates,  s^ 
of  the  discriminal  dispersions,  s^  .  The  initial  t-estimates,  t  ^  , 

would,  of  course,  have  to  be  converted  to  meet  the  restrictions  of  equa¬ 
tions  48  and  49. 

In  other  words,  if  some  set  of  k  numbers,  v^  ,  is  available  as 
possible  estimates  of  the  category  boundaries,  before  they  are  used  in 
the  above  solution,  they  must  first  be  converted  to  meet  the  restrictions 
of  equations  48  and  49-  This  can  be  done  as  follows: 


M _ 
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gl 


,  where 


I 


^  "  *  f  4  K 


and 


cr  = 
V1 


M 


.  k  n 

sfvV 


(51) 


Initial  estimates  of  s^  can  then  he  obtained  from  equation  20  as 
follows : 


k  k  k  k 

(Ew  t  z  )(Ew.  )  -  (Ew.  z.  )(Ewj  t  .  ) 
„  ig  gl  ig/v_  ig'  ig  lgM_  ig  gl' 

3  K  —a - B_ _ B _ B _ 

il  k  „  k  k  _ 

(fi/igHfig)  -  <figV 


(5a) 


If  a  subscript  _a  is  introduced  to  indicate  the  Ofth  cycle  in  the  itera¬ 
tive  procedure,  equation  52  can  be  rewritten  as  a  formula  for  the  crth 
estimate  of  s. 


’i  * 


k 


(Ew  t  z  )(Ew  )  -  (Ew.  z  )(Ew.  t  ) 
ig  ga  ig'v  ig'  '  ig  igM_  ig  ga  ’ 

S  *  — » - ■ - B - , - B - - - EL. 


ia 


(53) 


In  order  to  set  up  systematic  computational  routines,  it  would  seem 
desirable  at  this  point  to  define  the  following  coefficients : 


p 
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k 

Ew 


ig 


A,  - 


(54) 


(Ew  z  )‘ 

g 18 18 


Bi a 


Ew.  z, 

A 18  18 


(55) 


(Ew  z  )‘ 

g 18 18 


o,  a. 


v,  2 
Ew.  z. 

g  lg  lg 


(56) 


(fig^igXfig) 


(Ew  z  )‘ 
'  ig  ig 


Since  the  components  of  the  above  coefficients  are  obtainable  directly 
from  the  data,  they  are  the  same  for  all  cycles  of  iteration  and  need 
be  computed  only  once  for  the  entire  procedure. 

Equation  53  for  the  ath  estimate  of  the  discriminal  dispersions, 
s.^  ,  may  now  be  written  as 


a.  m  A.  Ew.  t  z. 

ice  i  ig  ga  ig 

o 


B.  2w  t 

i  g  ig  ga 


(57) 


Having  found  estimates  for  t  and  ,  the  ath  estimate  of  the 

scale  values,  m^  ,  may  now  be  obtained  from  equations  22,  5-5,  and  56 
as  follows : 


m.  =  C,  Ew  t  -  B  Ew  t  z 

ia  i  ig  ga  i  ig  ga  ig 


(58) 
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Now,  new  estimates  of 
equation  25: 


g 


may  be  obtained  from  a  formula  analogous  to 


vg(a+l) 


n 

Ew 

1 


igmia  + 


n 

Ew 


igSiazig 


(59) 


A  new  estimate  of  the  t-scale  may  now  be  found  by  using  equation  50  as 
follows : 


tg(a+l) 


vg(q+i) 


fvg(a+l) 


n 


(60) 


The  above  procedure  may  be  repeated  by  inserting  this  value  of 

■tg(a+l)  into  equation  57  to  obtain  s^(a+;j_)  >  which  in  turn  may  be 

used  to  obtain  (a+2_ )  and,  subsequently,  .  This  cycle  may 

then  be  Iterated  until  two  successive  estimates  of  t  are  as  similar 

g 

as  desired,  i.e.,  until  [  t^a+^  -  t^  ]  is  negligible. 

The  amount  of  scaling  error  at  any  given  cycle  of  iteration  may  be 
evaluated  from  equation  with  a  set  equal  to  zero  and  b  to  unity, 
as  follows : 


E. 


a 


w  - 


k 

Et 

g 


ga 


n 

Ew 

i 


ig(mia 


siazig^ 


(61) 


The  error  should  decrease  as  iteration  proceeds  until  at  convergence 


E  =  W 7  =  W(l 


k  p  n 

yr  Ev2  Ew,  ) 
“g'l'8 


6 

(62) 
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The  one  step  remaining  to  be  considered  before  the  above  iterative 
procedure  may  be  applied  in  practice  is  the  initial  estimation  of  the 
t-scale.  One  obvious  starting  point  might  be  a  set  of  equally  spaced 
numbers,  such  as  the  integers  from  1  to  _k_ ,  to  which  the  conversion  of 
equation  51  had  been  applied.  By  using  such  equally  spaced  t  values, 
a  set  of  "equal-appearing"  intervals  is  used  as  the  starting  point  for 
iteration  to  successive  intervals.  It  may  be  possible,  also,  to  increase 
the  rate  of  convergence  in  the  iterative  procedure  by  doubling  or  tripling 
the  difference  between  successive  t-estimates,  i.e.,  instead  of  using 

‘gta+l)  on  the  (a  +  trlal-  use  *g(a+l)  *  *gS  +  2t*g(a+l)  -  V  • 

A  cycle  or  two,  and  in  some  cases  perhaps  several  cycles,  may  be 

eliminated  from  the  iterative  procedure  by  using  a  computationally  simple 

linear  solution  for  tg  as  a  first  estimate.  One  of  the  simplest  methods 

for  estimating  t  has  been  suggested  by  Garner  and  Hake  (6)  and*by 
6 

Edwards  (5)  and  involves  averages  of  successive  differences  in  z_^ 

values.  Such  averages  are  estimates  of  (t  -  t  . )  provided  the  dis- 

6  6”-*- 

criminal  dispersions  may  be  assumed  equal.  Torgerson  (21 )  also  gives 

a  simple  algebraic  ratio  solution  for  t  which  does  not  require  equal 

S 

s^  .  Any  of  these  algebraic  solutions  may  be  used  to  obtain  initial 
estimates  for  the  iterative  procedure,  but  the  labor  involved  might  turn 
out  to  be  as  great  as  that  in  the  cycle  or  two  eliminated. 


Some  Weighting  Systems  Appropriate  for  Successive  Intervals  Data 


Some  comment  is  necessary  at  this  point  concerning  the  weights,  w  , 

^■8 

involved  in  the  above  least  squares  solution.  It  will  be  recalled  that 
the  only  restriction  placed  upon  the  choice  of  these  weights  was  that 


21 


0  and  w,  z  *=  0  when 

lg  lg 


If  £  equals  neither  zero  nor  one,  the  weights  may  be  set  at  any  values 
desired,  e.g.,  w ^  may  be  set  equal  to  unity  for  0  <  p  <  1 

However,  as  Urban  (2j)  hds  pointed  out,  it  is  a  principle  of  the 
method  of  least  squares  that  more  reliable  observations  should  have 
greater  importance  than  less  reliable  ones,  so  it  would  seem  reasonable 
to  choose  the  weights  in  such  a  way  as  to  take  account  of  differences 
in  the  reliability  of  the  z^  values .  Since  the  reliability  of  a  pro¬ 
portion  is  inversely  related  to  its  variance,  the  weights  used  for  such 

a  purpose  would  be  proportional  to  —  .  These  weights  would  also  be 

pq 

directly  proportional  to  the  information  available  from  the  observations, 
since  the  reciprocal  of  the  variance  may  be  Identified  with  quantity  of 
information  (5). 

Another  point  must  also  be  considered  in  selecting  weights  for 
successive  intervals  data.  As  reported  by  Guilford  (8,  p.  175);  "Muller 
argued  that  the  proportions  near  .50  should  be  weighted  more  than  the 
proportions  deviating  in  either  direction  from  .50. ..an  error  in  £ 
near  .50,  where  £  is  changing  at  its  maximum  rate,  as  compared  with 
the  change  in  (  z  ),  is  not  nearly  so  serious  as  at  the  extremes  where  a 
slight  error  in  £  is  reflected  in  a  large  error  in  (  z  )."  Milller,  then, 
waa  advocating -a  system  which  would  weight  directly  in  proportion  to  the 
rate  of  change  of  £  with  respect  to  i  .  These  weights  turn  out  to 
be  the  ordinates,  X  ,  of  the  normal  distribution  corresponding  to 
proportions,  £  ;  a  recent  derivation  of  these  weights  is  given  by 
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Finney  (k).  In  practice  both  weighting  conditions  are  usually  combined 

X2 

to  produce  the  Muller-Urban  weights,  —  . 

It  can  also  be  shown  that  within  an  approximation  the  Muller-Urban 
weights  are  the  proper  values  for  weighting  normally  transformed  scores 
inversely  to  their  variance.  Consider  the  following  approximate  relation¬ 
ship  stated  by  Kendall  (ll,  p.  206): 


var  p  =  ,(|j^)2  var  z 


where  var  is  an  abbreviation  for  variance.  If  £  and  z  are  related 
by  the  unit  normal  distribution  function,  as  is  the  case  for  successive 
intervals,  ^  is  equal  to  X  ,  the  ordinate  of  the  unit  normal  curve 
corresponding  to  a  proportion,  p  .  Therefore, 


var  p  =  X  var  z 


var  z 


Thus,  in, weighting  inversely  to  the  variance  of  z  ,  the  weight  would 

X2 

be  proportional  to  —  ,  since  N  is  a  constant  in  the  successive 

intervals  situations. 

However,  it  is  possible  to  use  a  simpler  set  of  weights  than 

g 

X  /pq  ,  without  sacrificing  completely  the  differentiation  between 


reliable  and  unreliable  z ^  values.  For  instance,  one  possible  rule 
for  weighting  would  be  to  assign  zero  if  the  corresponding  proportion 
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1 

contained  less  than  some  specified  fraction,  —  ,  of  the  maximum  possible 

information  (corresponding  to  p  =  „5  )  and  unity  if  it  contained  more  than 


—  th  the  maximum  information.  Or,  all  :  z 


ig 


>  c  could  he  weighted  zero, 


and  all  J  z^  j  <  c  could  he  weighted  unity;  such  a  rule  with  a  value  of 
c  =  2  has  been  found  to  be  convenient  in  practice  (7,  8). 

The  use  of  a  simple  set  of  unit  and  zero  weights  also  simplifies  some 
of  the  procedures  involved  in  the  above  iterative  solution.  For  instance, 
if 


w.  = 
ig 


jo  for 

V 

■u 1 

(l  for  | 

Kl 

or  alternatively  stated,  if 


w 


ig 


0  for  F  <  c  or  F.  >  cc 
ig  1  ig  2 


,1  for  c.,  <  F.  <  c^ 
1  -  ig  -  2 


*  C1  <  C2  ' 


the  weights  can  be  applied  simultaneously  with  the  conversion  to  z 
? 

ig 


i*e>;  F  <  c,  or  F.  >  c_  can  be  converted  to  zero,  while  c.  <  F.  <  c_ 
1  ig2  ’  1—  ig—  2 


ig  ' 
ig 


are  converted  to  the  corresponding  normal  deviates,  z^  .  With  this  kind 
of  procedure,  the  deviate  values  available  for  manipulation  have  already 
had  the  weights  applied;  this  would  greatly  simplify  subsequent  computations. 


Summary  and  Illustration  of  Analytical  Procedures 

Ihe  analytical  procedures  involved  in  the  above  least  squares  solu¬ 
tion  for  successive  intervals  will  now  be  summarized,  and  an  errorless 
numerical  example  will  be  used  to  illustrate  the  computational  routine. 


1.  The  experimental  method  of  successive  intervals  yields  the 
category  (l  to  k+l)  into  which  each  of  n  stimuli  was 
placed  hy  each  of  N  individuals.  These  data  may  he  sum¬ 
marized  into  an  n  X(k+l)  table,  the  cell  entries  of  which, 
f^  ,  represent  the  number  of  times  the  ith  stimulus  was 
placed  in  the  _gth  category.  By  cumulating  the  frequencies 
in  each  row  of  this  table  so  that  each  entry  now  represents 
the  number  of  times  the  ith  stimulus  appeared  below  the 

gth  category  boundary,  t  ,  a  set  of  cumulated  frequencies, 

S 

F.  ,  is  obtained,  which  can  be  considered  to  be  the  start - 
ig  ’ 

ing  point  for  successive  intervals  analysis. 

2..  The  cumulated  frequencies  are  then  converted  into  proportions, 

P.  ,  and  then  to  normal  deviate  values,  z,  .  For  the 

is  is 

purpose  of  illustrating  computational  procedures,  consider 

the  set  of  z^g  values  presented  in  Table  1.  The  four  scale 

values,  m^  ,  four  discriminal  dispersions,  s^  ,  and  three 

category  boundaries,  t  ,  which  exactly  fit  these  z.  values 

6  ^*S 

under  the  restrictions  of  equations  48  and  4-9  are  also  given  in 
Table  1.  Knowing  a  "true"  set  of  scale  values  and  category 
boundaries,  the  convergence  of  the  above  iterative  solution 
may  be  illustrated. 

3.  A  set  of  weights,  w^^  ,  and  an  initial  estimate,  t^  ,  of 

the  category  boundaries  must  now  be  determined.).. ’For/ the  .present 
example,  it  was  decided  to  use  the  weights  given  in  Table  1; 
they  were  assigned  so  that 


r 


0  for 
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Jig! 


3.0 


wig  =  “S1  for  3.0>  jzigj  *2,0 


V2  for  |zig|<2*° 

It  was  also  decided,  to  use  an  equally-spaced  scale  as  a 

first  estimate  of  t  .  Accordingly,  using  the  integers  1, 

g 

2,  and  3  as  v  ,  ,  the  conversion  of  equation  51  produced  as 

g.L 

t  ^  the  values  -1.16422,  .15523,  1.47469,  It  should  he  noted 

that  if  the  "true"  t  values  given  in  Table  1  were  used  as 

g 

first  estimates  in  the  present  iterative  procedure,  they  would 
be  exactly  reproduced  at  the  end  of  one  cycle. 

4.  Now,  the  coefficients  ,  and  Ch  may  be  computed 

according  to  equations  54,  55,  and  56,  respectively |  these 
values  are  presented  in  Table  2,  along  with  the  values  of 

k  k. 

fisV  ana  figVig  • 

a  0 

5.  Sufficient  information  is  now  available  to  solve  for  first 
estimates  of  the  discriminal  dispersions,  s^  ,  using 
equation  57  (see  Table  2). 

6.  Now,  first  estimates  of  the  scale  values,  nu^  ,  can  be 
obtained,  using  equation  58  (see  Table  2), 

7.  New  estimates,  tgg  ,  of  the  category  boundaries  may  now 
be  found  from  equations  59  and  60.  It  will  be  noted  that 

tgg  given  in  Table  2  is  closer  to  the  "true"  t-scale  than 
t  ..  was . 

S  k 

8.  In  order  to  iterate  this  solution,  new  values  of  Zw.  t  _ 

_  ig  g2 

k  ® 

and  ^wig^g2zig  raus"^  computed.  Using  these  values. 


"■13T 
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equations  57  and.  respectively,  may  be  solved  to  obtain 
s^g  and  m^g  .  Then,  equations  59  and  60  may  be  used  to 
obtain  t  _  ,  the  third  estimate  of  the  t-scale  (see  Table 

3).  It  should  again  be  noted  that  the  estimates  of  t  at 

g 

each  successive  cycle  are  approaching  closer  and  closer  the 

"true"  t  scale.  This  procedure  may  now  be  repeated  until 
g 

two  successive  t-estimates  are  as  similar  as  desired. 


A  Graphical  Successive  Intervals  Scaling  Procedure 


It  was  seen  from  equation  21  that  s.  is  the  regression  coefficient 
for  the  regression  of  t  on  z  .  This  suggests  a  graphical  solution 
to  successive  intervals,  which  will  be  summarized  below;  the  procedures 
to  be  presented  bear  some  similarity  to  the  graphical  methods  of  Mosier 
(12)  and  Garner  and  Hake  (6). 

1-3-  Steps  1  through  3  of  the  graphical  procedure  are  identical  to 
the  corresponding  steps  of  the  above  analytical  procedure.  In 
order  to  utilize  the  graphical  method,  a  first  estimate,  t  ^  , 

of  the  category  boundaries  must  be  available,  along  with  normal 
deviate  values  and  their  corresponding  weights. 

4.  The  estimated  t  ,  values  are  then  marked  off  as  the  ordinate 

gl 

of  a  graph  with  z  values  used  as  the  abscissa.  The  t  ^ 
values  are  horizontal  lines  that  hold  for  all  stimuli,  so 
several  plots  can  be  made  on  one  graph  (see  Figure  l).  For 


each  stimulus,  the  zig  values  are  plotted  at  the  appropriate 
t  ^  points,  i.e.,  for  stimulus  2  in  the  above  numerical 
example,  points  would  be  plotted  at  (  t  ^  =  -I.l64  ,  z  =  -.5  ), 

(  tgl  =  .155  ,  z  =  .5  ),  and  (  tgl  =  1.475  ,  z  =  2.0  ),  as 

illustrated  in  Figure  1.  Weights  can  be  applied  in  the  graphical 


F 
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procedure  by  clustering  around  each  point  a  number  of  dots  pro¬ 
portional  to  the  corresponding  weight.  A  straight  line  can  now 
be  fitted  to  the  points  by  eye,  giving  more  emphasis  to  those 
points  with  bigger  dot  clusters  in  determining  the  slope  of  the 
line . 

5».  The  equation  of  each  of  these  lines  can  be  written  as 
*61  ’  "ll  +  8ilZlg 


or,  more  generally,  as 


t  =  m.  +  s .  z. 
ga  ia  ia  ig 


m 


The  slope,  ,  of  each  line  is  the  0$hi;.  estimate  of  the 

discriminal  dispersion,  and  the  intercept,  ,  when  z  =  0 

is  the  otth  estimate  of  the  scale  value .  These  intercepts  and 
slopes  can  be  read  directly  from  the  graph,  but  they  need  not  be 
recorded  until  the  final  iteration. 

6.  In  practice,  the  straight  lines  fitted  to  the  plotted  values 
will  rarely  cross  every  point;  each  point  will  usually  deviate 
from  the  line  by  some  amount,  the  amount  of  this  deviation  in 
the  vertical  direction  representing  a  scaling  error  (see  Figure  l). 

The  vertical  projection  of  a  plotted  point  on  the  fitted  line 

A 

produces  another  point,  t  ,  which,  since  it  lies  directly  on 

-*•0 


the  line,  represents  a  theoretical  or  fitted  estimate  of  t 


g 


(see  Figure  l).  For  a  given  category  boundary,  there  are  n 

A 

fitted  estimates  t.  ,  one  for  each  stimulus.  If  the  ordinates 

ig  ’ 

Ay 

of  these  t.  ’  values  are  recorded,  weighted  averages  of  the 
^-6 
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ordinates  can  be  used  to  obtain  a  new  estimate  of  t  as  follows 

6 


n 


Zw.  t .  „ 
±  ig  iga 


n 

Zw 


-  v. 


g(Q£+l) 


(66) 


ig 


t  -  Vg(oi+l)  "  V(oi+l) 

g(a+l)  “  a 

V(a+1) 


(67) 


where 

^  k  n 

v(a+l)  =  W  g  vg(a+l)  ^wig 

and 

ffv(a+l)  =/i  l  |~vg(a+l)  "  y(a+l)]2  *wig 

The  only  value ,  then,  that  need  be  read  from  the  graphs  in 
going  from  one  iterative  cycle  to  another  is  iiga  •  The  slopes 
and  intercepts  corresponding  to  discriminal  dispersions  and 
scale  values  do'  not  need  to  be  recorded  until  the  final  iteration 
7.  This  new  estimate  of  t  may  now  be  plotted  as  the  ordinate  of 
a  graph  with  z  values  marked  off  as  the  abscissa.  The  cycle 
may  then  be  repeated  beginning  at  step  4  until  two  successive 
t-estimates  are  as  similar  as  desired. 

Summary 

A  general  least  squares  solution  is  developed  for  the  method  of 
successive  intervals  when  weights  are  assigned  to  each  error  term. 

These  weights  may  reflect  the  relative  amount  of  information  contained 
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in  the  corresponding  observations.  By  use  of  zero  weights,  this  method 
is  rigorously  applicable  to  matrices  involving  incomplete  data.  An 
iterative  computational  procedure  is  presented. 
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Grophicol  Solution  for  Numerical  Example 
Beginning  with  Equally  Spaced  t0( 
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TABLE  1 
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